Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Complex documents images segmentation based on steerable pyramid features

Identifieur interne : 000135 ( France/Analysis ); précédent : 000134; suivant : 000136

Complex documents images segmentation based on steerable pyramid features

Auteurs : Mohamed Benjelil [Tunisie] ; Slim Kanoun [Tunisie] ; Rémy Mullot [France] ; Adel Alimi [Tunisie]

Source :

RBID : Hal:hal-00495728

English descriptors

Abstract

Page segmentation and classification is very important in document layout analysis system before it is presented to an OCR system or for any other subsequent processing steps. In this paper, we propose an accurate and suitably designed system for complex documents segmentation. This system is based on steerable pyramid transform. The features extracted from pyramid sub-bands serve to locate and classify regions into text (either machine-printed or handwritten) and non-text (images, graphics, drawings or paintings) in some noise-infected, deformed, multilingual, multi-script document images. These documents contain tabular structures, logos, stamps, handwritten script blocks, photographs, etc. The encouraging and promising results obtained on 1,000 official complex document images data set are presented in this research paper. We compared our results with those from existing state-of-the-art methods. This comparison shows that the proposed method performs consistently well on large sets of complex document images.

Url:
DOI: 10.1007/s10032-010-0113-9


Affiliations:


Links toward previous steps (curation, corpus...)


Links to Exploration step

Hal:hal-00495728

Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Complex documents images segmentation based on steerable pyramid features</title>
<author>
<name sortKey="Benjelil, Mohamed" sort="Benjelil, Mohamed" uniqKey="Benjelil M" first="Mohamed" last="Benjelil">Mohamed Benjelil</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-210908" status="VALID">
<orgName>REsearch Group in Intelligent Machines</orgName>
<orgName type="acronym">REGIM</orgName>
<desc>
<address>
<country key="TN"></country>
</address>
<ref type="url">http://regim.org/</ref>
</desc>
<listRelation>
<relation active="#struct-301282" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301282" type="direct">
<org type="institution" xml:id="struct-301282" status="VALID">
<orgName>École Nationale d'Ingénieurs de Sfax [Sfax]</orgName>
<orgName type="acronym">ENIS</orgName>
<desc>
<address>
<addrLine>Dépt. G.E, (ENIS), B.P. 1173, 3038 Sfax</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.enis.rnu.tn/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
<author>
<name sortKey="Kanoun, Slim" sort="Kanoun, Slim" uniqKey="Kanoun S" first="Slim" last="Kanoun">Slim Kanoun</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-210908" status="VALID">
<orgName>REsearch Group in Intelligent Machines</orgName>
<orgName type="acronym">REGIM</orgName>
<desc>
<address>
<country key="TN"></country>
</address>
<ref type="url">http://regim.org/</ref>
</desc>
<listRelation>
<relation active="#struct-301282" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301282" type="direct">
<org type="institution" xml:id="struct-301282" status="VALID">
<orgName>École Nationale d'Ingénieurs de Sfax [Sfax]</orgName>
<orgName type="acronym">ENIS</orgName>
<desc>
<address>
<addrLine>Dépt. G.E, (ENIS), B.P. 1173, 3038 Sfax</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.enis.rnu.tn/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
<author>
<name sortKey="Mullot, Remy" sort="Mullot, Remy" uniqKey="Mullot R" first="Rémy" last="Mullot">Rémy Mullot</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Alimi, Adel" sort="Alimi, Adel" uniqKey="Alimi A" first="Adel" last="Alimi">Adel Alimi</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-210908" status="VALID">
<orgName>REsearch Group in Intelligent Machines</orgName>
<orgName type="acronym">REGIM</orgName>
<desc>
<address>
<country key="TN"></country>
</address>
<ref type="url">http://regim.org/</ref>
</desc>
<listRelation>
<relation active="#struct-301282" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301282" type="direct">
<org type="institution" xml:id="struct-301282" status="VALID">
<orgName>École Nationale d'Ingénieurs de Sfax [Sfax]</orgName>
<orgName type="acronym">ENIS</orgName>
<desc>
<address>
<addrLine>Dépt. G.E, (ENIS), B.P. 1173, 3038 Sfax</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.enis.rnu.tn/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-00495728</idno>
<idno type="halId">hal-00495728</idno>
<idno type="halUri">https://hal.archives-ouvertes.fr/hal-00495728</idno>
<idno type="url">https://hal.archives-ouvertes.fr/hal-00495728</idno>
<idno type="doi">10.1007/s10032-010-0113-9</idno>
<date when="2010">2010</date>
<idno type="wicri:Area/Hal/Corpus">000034</idno>
<idno type="wicri:Area/Hal/Curation">000034</idno>
<idno type="wicri:Area/Hal/Checkpoint">000113</idno>
<idno type="wicri:Area/Main/Merge">000602</idno>
<idno type="wicri:Area/Main/Curation">000597</idno>
<idno type="wicri:Area/Main/Exploration">000597</idno>
<idno type="wicri:Area/France/Extraction">000135</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">Complex documents images segmentation based on steerable pyramid features</title>
<author>
<name sortKey="Benjelil, Mohamed" sort="Benjelil, Mohamed" uniqKey="Benjelil M" first="Mohamed" last="Benjelil">Mohamed Benjelil</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-210908" status="VALID">
<orgName>REsearch Group in Intelligent Machines</orgName>
<orgName type="acronym">REGIM</orgName>
<desc>
<address>
<country key="TN"></country>
</address>
<ref type="url">http://regim.org/</ref>
</desc>
<listRelation>
<relation active="#struct-301282" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301282" type="direct">
<org type="institution" xml:id="struct-301282" status="VALID">
<orgName>École Nationale d'Ingénieurs de Sfax [Sfax]</orgName>
<orgName type="acronym">ENIS</orgName>
<desc>
<address>
<addrLine>Dépt. G.E, (ENIS), B.P. 1173, 3038 Sfax</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.enis.rnu.tn/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
<author>
<name sortKey="Kanoun, Slim" sort="Kanoun, Slim" uniqKey="Kanoun S" first="Slim" last="Kanoun">Slim Kanoun</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-210908" status="VALID">
<orgName>REsearch Group in Intelligent Machines</orgName>
<orgName type="acronym">REGIM</orgName>
<desc>
<address>
<country key="TN"></country>
</address>
<ref type="url">http://regim.org/</ref>
</desc>
<listRelation>
<relation active="#struct-301282" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301282" type="direct">
<org type="institution" xml:id="struct-301282" status="VALID">
<orgName>École Nationale d'Ingénieurs de Sfax [Sfax]</orgName>
<orgName type="acronym">ENIS</orgName>
<desc>
<address>
<addrLine>Dépt. G.E, (ENIS), B.P. 1173, 3038 Sfax</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.enis.rnu.tn/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
<author>
<name sortKey="Mullot, Remy" sort="Mullot, Remy" uniqKey="Mullot R" first="Rémy" last="Mullot">Rémy Mullot</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-40831" status="VALID">
<orgName>Laboratoire Informatique, Image et Interaction</orgName>
<orgName type="acronym">L3I</orgName>
<desc>
<address>
<addrLine>Bâtiment Pascal Avenue Michel Crépeau F-17042 La Rochelle Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-lr.fr/l3i</ref>
</desc>
<listRelation>
<relation name="EA2118" active="#struct-300311" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA2118" active="#struct-300311" type="direct">
<org type="institution" xml:id="struct-300311" status="VALID">
<orgName>Université de La Rochelle</orgName>
<desc>
<address>
<country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">La Rochelle</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de La Rochelle</orgName>
</affiliation>
</author>
<author>
<name sortKey="Alimi, Adel" sort="Alimi, Adel" uniqKey="Alimi A" first="Adel" last="Alimi">Adel Alimi</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-210908" status="VALID">
<orgName>REsearch Group in Intelligent Machines</orgName>
<orgName type="acronym">REGIM</orgName>
<desc>
<address>
<country key="TN"></country>
</address>
<ref type="url">http://regim.org/</ref>
</desc>
<listRelation>
<relation active="#struct-301282" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle active="#struct-301282" type="direct">
<org type="institution" xml:id="struct-301282" status="VALID">
<orgName>École Nationale d'Ingénieurs de Sfax [Sfax]</orgName>
<orgName type="acronym">ENIS</orgName>
<desc>
<address>
<addrLine>Dépt. G.E, (ENIS), B.P. 1173, 3038 Sfax</addrLine>
<country key="TN"></country>
</address>
<ref type="url">http://www.enis.rnu.tn/</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>Tunisie</country>
</affiliation>
</author>
</analytic>
<idno type="DOI">10.1007/s10032-010-0113-9</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="en">
<term>Complex document segmentation</term>
<term>Multi-resolution analysis</term>
<term>Steerable pyramid</term>
<term>invariant features</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Page segmentation and classification is very important in document layout analysis system before it is presented to an OCR system or for any other subsequent processing steps. In this paper, we propose an accurate and suitably designed system for complex documents segmentation. This system is based on steerable pyramid transform. The features extracted from pyramid sub-bands serve to locate and classify regions into text (either machine-printed or handwritten) and non-text (images, graphics, drawings or paintings) in some noise-infected, deformed, multilingual, multi-script document images. These documents contain tabular structures, logos, stamps, handwritten script blocks, photographs, etc. The encouraging and promising results obtained on 1,000 official complex document images data set are presented in this research paper. We compared our results with those from existing state-of-the-art methods. This comparison shows that the proposed method performs consistently well on large sets of complex document images.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
<li>Tunisie</li>
</country>
<region>
<li>Poitou-Charentes</li>
</region>
<settlement>
<li>La Rochelle</li>
</settlement>
<orgName>
<li>Université de La Rochelle</li>
</orgName>
</list>
<tree>
<country name="Tunisie">
<noRegion>
<name sortKey="Benjelil, Mohamed" sort="Benjelil, Mohamed" uniqKey="Benjelil M" first="Mohamed" last="Benjelil">Mohamed Benjelil</name>
</noRegion>
<name sortKey="Alimi, Adel" sort="Alimi, Adel" uniqKey="Alimi A" first="Adel" last="Alimi">Adel Alimi</name>
<name sortKey="Kanoun, Slim" sort="Kanoun, Slim" uniqKey="Kanoun S" first="Slim" last="Kanoun">Slim Kanoun</name>
</country>
<country name="France">
<region name="Poitou-Charentes">
<name sortKey="Mullot, Remy" sort="Mullot, Remy" uniqKey="Mullot R" first="Rémy" last="Mullot">Rémy Mullot</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/France/Analysis
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000135 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/France/Analysis/biblio.hfd -nk 000135 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    France
   |étape=   Analysis
   |type=    RBID
   |clé=     Hal:hal-00495728
   |texte=   Complex documents images segmentation based on steerable pyramid features
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024